An Evaluation of TRACA's Generalisation Performance
نویسنده
چکیده
TRACA (Temporal Reinforcement learning and Classiica-tion Architecture) is a learning system intended for robot-navigation problems (Mitchell 2000). One problem in this area is the input-generalisation problem. Input generalisation requires learning a small set of internal states which represent useful abstractions of the much larger set of actual states. As such, the input-generalisation problem is fundamentally similar to the classical problems of classiication, concept learning and discrimination. However, for on-line robot-learning tasks diierent evaluation criteria is applied than that for batch clas-siication tasks. Speciically, a small number of trials is desirable to reduce the risks of damage to the agent and/or its environment. This may come at the cost of more computation during learning and slightly lower predictive accuracy. Other requirements are the abililty to learn on-line without predeened classes (i.e classes must be learned during training), an eecient adaptable representation and minimal parameter tuning. This paper describes TRACA's generalisation mechanism in detail and evaluates its performance on a number of common classiication tasks. The ability of TRACA to use short-term memory to represent hidden-state is ignored in this comparison as in all the following tasks perceptual aliasing can be overcome by including additional features. On most tasks, TRACA's pre-dictive accuracy is within a few percent of the best performing systems compared and TRACA's result is often achieved with less training experience. The experiments also support claims by Holte (Holte 1993) that a high predictive accuracy (above 90 percent in these experiments) can easily be achieved on many well-known classiication tasks which are often used for evaluating learning systems.
منابع مشابه
Designing generalisation evaluation function through human-machine dialogue
Automated generalisation has known important improvements these last few years. However, an issue that still deserves more study concerns the automatic evaluation of generalised data. Indeed, many automated generalisation systems require the utilisation of an evaluation function to automatically assess generalisation outcomes. In this paper, we propose a new approach dedicated to the design of ...
متن کاملComplexity Curve: a Graphical Measure of Data Complexity and Classifier Performance Supplementary document S2: Evaluating Classifier Performance with Generalisation Curves
We discussed the role of data complexity measures in the evaluation of classification algorithms performance. Knowing characteristics of benchmark data sets it is possible to check which algorithms perform well in the context of scarce data. To fully utilise this information, we present a graphical performance measure called generalisation curve. It is based on learning curve concept and allows...
متن کاملThere is No Free Lunch but the Starter is Cheap : Generalisation from First Principles
According to Wolpert's no-free-lunch (NFL) theorems [Wolpert, 1996b, Wolpert, 1996a], generalisation in the absence of domain knowledge is necessarily a zero-sum enterprise. Good generalisation performance in one situation is always offset by bad performance in another. Wolpert notes that the theorems do not demonstrate that effective generalisation is a logical impossibility but merely that a ...
متن کاملALGEBRAIC INDEPENDENCE OF CERTAIN FORMAL POWER SERIES (I)
We give a proof of the generalisation of Mendes-France and Van der Poorten's recent result over an arbitrary field of positive characteristic and then by extending a result of Carlitz, we shall introduce a class of algebraically independent series.
متن کاملUsing Regular Approximations for Generalisation During Partial Evaluation
On-line partial evaluation algorithms include a generalisation step, which is needed to ensure termination. In partial evaluation of logic and functional programs, the usual generalisation operation applied to computation states is the most speci c generalisation (msg) of expressions. This can cause loss of information, which is especially serious in programs whose computations rst build some i...
متن کامل